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Patent Application For Dr. James Kaput 

IDENTIFICATION OF DIET-REGULATED DISEASE-ASSOCIATED GENES 

[0001] The present application claims the benefit of United States provisional 

application Serial Number 60/423,104 filed November 1 st 2002, which application is 
incorporated by reference herein. 

[0002] The work presented herein has been partially funded by Illinois-Missouri 

Biotechnology Alliance (a USDA program, 94-34346-1795) and a American Institute for 
Cancer Research (NC93B63) funded CRADA with the National Center for 
Toxicological Research (Protocol E0260301) 

[0003] BACKGROUND 

[0004] Diet plays a major role in disease causation and progression. At its most extreme 

level, starvation and nutritional deficiency disrupts metabolism so as to cause illness and 
death. At a more subtle level, many disease-related genes are regulated at some level by 
nutritional factors. The chemicals present in food (nutrients) may regulate gene 
expression at the transcription or translational level, at the level of post-translational 
modification, degradation or enzymatic activity. Glucose regulation of insulin and 
glucagon is a well-known example of diet-regulated enzyme regulation. The amount of 
a nutrient present in the body influences the equilibrium of metabolic pathways. In one 
very simple form, this can be thought of as a simple mass-action effect where the 
concentration of a metabolite (reactant) on the left hand side of an equation will drive the 

equilibrium to the right hand side, e.g., A+B <r> C+D. If the concentration of reactant A 

increases, all other factors remaining constant, the equilibrium will shift to the right. But 
unlike this simple example, metabolic pathways are often very complex, replete with 
redundancy, and are interrelated with many other metabolic pathways. Therefore, 
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altering the concentration of a single metabolite often has pleioropic effects on highly 
disparate areas of physiology, including disease-related physiology. Hormones and 
other types of cell-signaling molecules, for example, are well-understood to be regulated 
by diet, and are equally well-known to be related to disease. There is also considerable 
evidence that total calorie intake may influence diseases including breast cancers, such 
as breast cancer. 

[0005] Diabetes is a well-known example of a nutrient-regulated disease in which 

nutrient-regulated messengers (insulin and glucagon) play a key role. Obesity is another 
well-known diet related disease in which nutrient-regulated messengers such as leptins, 
adiponectin, and resistin are important control factors. The Center for Disease Control 
has declared obesity and Type 2 diabetes epidemic in the U.S and in the western world. 
These conditions, and many other chronic diseases, are caused by multiple genes 
influenced by many environmental factors. The complexity of these diseases makes 
them difficult to diagnose and treat. Specifically, three key factors may affect the 
diagnosis and treatment of chronic diseases: 



[0006] a) The same disease phenotype may result from disturbance in different 

metabolic pathways 

[0007] b) The genetic makeup of each human differs, causing variation in response to 

the same factors 

[0008] c) Environmental factors, such as diet, influence health and disease development 

[0009] Chronic diseases, including obesity, Alzheimer's, diabetes, cardiovascular 



diseases, and certain cancers (among others), are generally produced by the interplay of 
environmental factors and genetic mechanisms. In addition, different members of the 
population showing clinical symptoms of any given disease can be grouped with each 
group having some unique genes or ESTs that contribute to disease formation. 
Furthermore, subsets of the unique and commonly-distributed genes are regulated 
directly or indirectly by foods chemicals. Genes that are diet-regulated and involved in 
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disease processes can be identified and grouped to provide diagnostic markers and 
targets for drugs. 

[0010] The Problems at Hand 

[0011] a) The same disease may result by disturbance of different pathways in the 

body. A large body of research has shown that a chronic disease may be caused by 
different pathways [1]. For example, tumors from different individuals may appear 
similar, but may be produced by mutations in different genes. Similarly, the clinical 
term "diabetes" encompasses a variety of different symptoms and causes all having to do 
with imbalances in insulin levels and sugar metabolism. Since several biological 
processes (or pathways) affect insulin production or serum glucose levels, changes or 
defects in any of the pathways may produce the same clinical manifestation of the 
disease. Genomics promises to produce the reagents and tests that will identify each 
sub-type making it easier to diagnose and properly treat the cause rather than the effect 
of diseases. The research underlying this disclosure is designed specifically for 
developing such tests and for developing drugs or medical foods formulated to treat the 
disease. 

[0012] b) Environmental factors, such as diet, influence health and disease 

development. Chronic diseases are often influenced by one or more environmental 
factors acting on the genetic expression of an individual [2-5]. Diet is the most 
important environmental factor influencing health because of the consistent exposure to 
the naturally-occurring chemicals in foods. Many humans eat two to three times per 
day. In addition to providing for our energy needs, food contains chemicals that may 
cause changes in the expression of genetic information. Many food chemicals are 
beneficial (e.g., protein, fats, or carbohydrates) when eaten in moderate amounts but may 
contribute to disease development when they are overconsumed or underconsumed. In 
addition, certain chemicals such as minerals, vitamins, and certain dietary fats and 
carbohydrates are required for growth, development, and health. Optimum health, 
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therefore, is obtained with the proper balance of the types and amounts of food 
chemicals. 

[0013] c) The genetic makeup of each human differs causing variation in response 

to the same factors. Identifying healthy diets would be less complicated than it has 
proven to be if all humans had an identical genome. The human genome project found 
that although humans are 99.9% identical at the genetic level [6-8]. But the small 
(0.1%) differences in genomic DNA sequences produce the variation in physical 
appearances (height, weight, skin coloration, hair texture and color, etc) and the 
responses to environmental factors. These variations within genes explains why any 
given drug may be effective for one patient but not another (pharmacogenomics). 
Similarly, individuals differ in their response to the type and amount of dietary 
chemicals. 

[0014] A well-recognized example of this interaction between a dietary chemical and 

genetic makeup is lactose intolerance [9-11]. Lactose in milk products can produce a 
painful digestive condition commonly associated with nausea, cramps, bloating, gas, and 
diarrhea unless the ingested lactose is metabolized by lactase, an enzyme that cleaves 
lactose to glucose and galactose. During most of human evolution, lactase was produced 
only in childhood. A mutation in the control region of the lactase gene that causes 
lactase to be made into adulthood occurred 10,000 to 12,000 years ago and was 
subsequently propagated in certain human populations. Those individuals lacking the 
mutation cannot tolerate milk, a selective disadvantage, as milk is highly nutritious. This 
easily understood yet dramatic example is a model for the more subtle metabolic 
differences that occur with variations in other genes and exposures to other types of 
chemicals. Weight, for example, can be control a lucky collection of normal gene 
variants that metabolize nutrients and avoiding the types and amounts of foods that 
contribute to weight gain. 
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[0015] The Present Solution 

[0016] Four broad approaches are used to identify gene or protein targets for developing 

diagnostics and drugs used to treat diet-associated diseases: (a) identification of genes, 
ESTs, transcripts or proteins that differ in presence, quantity or expression between 
normal and diseased tissues, (b) identification of clinical factors, e.g., cholesterol levels, 
that correlate with disease, and concomitant identification of genes involved in the 
pathway producing those factors, e.g., HMG CoA reductase), (c) epidemiological 
research associating an environmental factor with incidence or severity of disease and 
identification of a gene or protein associated with that disease and/or environmental 
factor, or (d) identification of chromosomal regions associated with disease and choice 
of most likely candidate genes within those regions. 

[0017] a) Identification of DNA/RNA/protein differences between normal and 

disease tissues: Molecular comparisons identify genes or transcripts or proteins whose 
levels are different among diseased vs. normal tissues [12]. Oligonucloetide array 
technologies provide a means to rapidly screen for the expression profiles of thousands 
of genes at one time. The genes identified as being differentially expressed between 
diseased and healthy tissues may be developed into diagnostics or drug targets. The 
risks of relying only upon differences in gene or protein expression to develop 
diagnostics or drugs are that: 

[0018] (i) such differences do not necessarily discriminate between differences that are 

either a cause or an effect of a disease state. 

[0019] (ii) the same disease may be produced by different pathways - a single marker or 

target will rarely identify or be effective against all forms of the disease. 

[0020] (iii) genetic differences confound analyses because any given gene could be 

normally, but differently expressed among individuals. 

[0021] (iv) environmental factors, which are rarely controlled in studies of isolated 

human tissue, might also cause differences in expression. 
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[0022] b) Identification of candidate genes from associations with measured clinical 

factors (e.g., cholesterol levels) that correlate with disease: Cholesterol levels that are 
1.2X the recommended level of 200 mg/dl have a 2 fold greater risk of heart disease 
(American Heart Association). Researchers therefore targeted the control point of the 
cholesterol biosynthetic pathway for development of drugs (statins) to control the level 
of total serum cholesterol. Statins do reduce cholesterol levels and reduce the relative 
risk of coronary heart disease by -25%. However, the absolute reduction in risk in one 
study (as an example) was 8.3% in the placebo group as compared to 6.4% in the 
pravastatin treated group [13] other references are found in ACC/AHA/NHLBI Clinical 
Advisory on the Use and Safety of Statins). 

[0023] Similar strategies of identifying known pathways, known genes, and known 

enzymes as drug targets are used for almost all major diseases including Alzheimer's, 
cancer, diabetes, and obesity. 

[0024] The major drawbacks of this strategy are that: 

[0025] (1) controlling a single aberrant physiological factor (such as total cholesterol) 

does not affect the many other pathways that contribute to the development 
of the disease. 

[0026] (2) genetic differences among individuals are not considered. 

[0027] (3) environmental influences of diet is also not considered. 

[0028] c) Identification of candidate genes the expression of which is correlated 

with environmental factors linked to a disease state by epidemiological evidence: 

Another strategy for identifying candidate genes is to analyze clinical data to make an 
epidemiological association between a gene, and environmental factor, and a disease [2]. 
For example, HMG CoA reductase, cholesterol, and heart disease. 
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[0029] d) Identification of chromosomal regions associated with disease and choice 

of most likely candidate genes within those regions: Genetic methods for identifying 
quantitative trait loci (QTL - A genetic region known to effect phenotypic variation in 
continuously varying traits like skin color, weight, etc.) have been developed over the 
past 15 years. Such methods identify regions of chromosomes encoding one or more 
genes that contribute to the development of a complex disease, e.g., diabetes [14-16]. 
Insulin-dependent diabetes mellitus (IDDM) is an example of the power and limitations 
of this approach. There are about 15 QTL loci, (IDDM1 through NIDDM15) regulating 
insulin, glucose, lipid, or hormone levels that directly contribute to diabetes 
development. Approximately 1 700 QTLs for diabetes, obesity, cancer, and other 
conditions have been determined in laboratory animals (Jacksone Laboratory web site) 
and a smaller subset has been identified in humans. 

Although QTL methodology provides valuable information, identifying all the 
candidate genes within QTLs is difficult because: 

(i) QTLs are specific only for the population (or laboratory animal strains) 
analyzed. 

(ii) The size of the region identified is variable (~1 to 20 cM, equivalent to 1 - 20 
million base pairs) and may encode many genes of known or unknown 
function. 

(iii) A QTL may be specific to environmental conditions under which they were 
determined [17, 18] and, 

(iv) The contribution of a given locus to a disease process varies between -1 and 
-25% [19], and individual effects may be too small to accurately be 
measured with existing biological techniques [16] 

[0035] The limitations of QTL mapping may be circumvented using positional mapping 

and cloning in selected human populations or families where the chronic disease 
develops earlier than normal. Myriad Genetics Corporation has identified two breast 
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cancer genes, BRCA1 and BRCA2, in early onset breast cancer patients using positional 
cloning and sequencing methods. More recently, 5 loci (MODY1 through MODY5) 
causing the autosomal dominant disease, mature onset diabetes of the young (MODY), 
have been discovered by positional cloning and characterized by DNA sequence 
analyses (rev. in [20]). MODY accounts for between 2 and 5 % of Type II diabetes. 
The genes at these loci, hepatocyte nuclear factor 1 alpha (HNF-1 alpha), glucokinase, 
HNF-4 alpha, HNF-1 beta, and insulin promoter factor 1 have yet to be linked to non- 
MOD Y diabetes development [21]. 

[0036] Approximately 20 candidate genes selected for their likely role in diabetes or 

from various NIDDM and obesity QTLs (some of these loci overlap) have been analyzed 
in genetically diverse populations with high incidence of diabetes (Table 1). 



Table 1 

Possible Candidate Genes for Type 2 Diabetes 0 


Mutated 
Gene 


Function 


Effect 


Linked to 


HNF-4 a, 
HNF-1 p 
IPF-1, 
NeuraDl 


Transcription Factors 


^Insulin Secretion 


MODY (maturity-onset 
diabetes in the young) 
human 


HNF-1 a, 


Transcription Factor 


^Insulin Secretion 


MODYOji-Cree diabetes 


Glucokinase 


Glucose Metabolism 


^Insulin Secretion 


MODY 


Calpain-10 


Protease 


Unknown 


Diabetes 2 in Mexican and 
African Americans 
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PPARy 


Transcription Factor 


^Insulin Sensitivity 


Diabetes 2 


Insulin 
Receptor 


Transmits Insulin Signals 
into cells 


^Insulin Secretion 
and Sensitivity 


Human diabetes (rare) 
mouse model 


IRS1 and -2 


Insulin Signaling 


^Insulin Sensitivity 


Mouse Models 


Akt2 


Insulin Signaling 


^Insulin Sensitivity 


Mouse Models 


11-P-HSD 


Glucocorticoid Synthesis 


/Blood Lipid, 
^Insulin Secretion 


Mouse Models 


UCP2 


UTP Synthesis 


^Insulin Secretion 


Mouse Models 


Resistin 


Fat Cell "hormone" 


J> Insulin Sensitivity 


Mouse Studies 


Adiponectin 


Fat Cell "hormone" 


T Insulin Sensitivity 


Mouse, Human Studies 



a Adapted from: [22] 



[0037] As an example, Bell and his associates [21] found a candidate gene at an NIDDM 

QLT that appears to be involved in the development of Type II diabetes. They carried 
out a genome-wide screen for Type II candidate genes in Mexican American families 
and identified a locus named NIDDM 1 . Subsequent positional cloning showed that 
Calpain 10, a cysteine protease, within the NIDDM 1 locus was associated with Type II 
diabetes in individuals from Botnia, a region of Finland, and in Mexican- Americans. 
Positional cloning to identify genes within QTLs is possible when the gene contributes 
significantly to the disease process, as is the case with Calpain 10. Identifying the genes 
or alleles within the other QTL for diabetes or other chronic diseases will require more 
sophisticated strategies. 

[0038] Although alleles of some of the genes show associations with disease 

development, most show no associations in the populations tested. Such results may be 
due to the small number of individuals analyzed and the need to analyze multiple alleles 
and genes with multiple phenotypic characterizations - most studies use single variant 
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analyses. Hence, the identity of the candidate genes in NIDDM and obesity QTLs have 
not been verified. 

[0039] Although a powerful technique, identifying all genes contributing to chronic 

disease development by QTL analyses by itself is prohibitively expensive because of the 
very large population needed to achieve reliable QLT identification, meaning that many 
QTL studies are prone to identifying the wrong genes as being associated with a trait, 
they may miss important loci and genes, and they inherently have problems with 
resolution, due to the size or the QTL fragments. 

[0040] Problems Inherent in the Present Solutions 

[0041] Many of the specific problems have been discussed above. The consequences 

from the current research and pharmaceutical approach of targeting a single gene, 
protein, or enzyme for drug development (assuming that: each patient is genetically 
identical, that a given disease has one physiological defect, and that environmental 
factors are negligible or unimportant) are that chronic diagnosis, is imprecise and 
treatments of disease are often ineffective. Health care costs, from insurance through 
delivery, are increasing exponentially while the quality of life in Western societies is 
static at best. Genomic research in academia and the pharmaceutical industry promises 
to alleviate severity of chronic diseases through the development of better diagnostics 
and medicines. However, the strategies and drug development pipelines fail to consider 
or address the interplay of environmental factors and genotype. The consequences are: 



[0042] Medicines that, on average, are effective in only 6 of 10 individuals. 

[0043] Trial and error approaches to treating chronic diseases increase health care costs. 

[0044] Increased incidence severity of chronic diseases and resulting personal and 

societal costs. 

[0045] BRIEF DESCRIPTION OF THE INVENTION 
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[0046] A method for identifying diet-regulated disease-associated polynucleotides by 

comparing gene expression between two inbred genotypes (strains) in response to 
different diets and identifying those differentially expressed polynucleotides that overlap 
with independently-derived diet-regulated QTLs. Methods are also disclosed for use in 
disease screening, monitoring and treatment, and for formulating medical foods. Arrays 
are also disclosed. Various newly identified diet-regulated disease-associated genes and 
ESTs are also disclosed, as are compositions of medical foods and dietary supplements. 



[0047] BRIEF DESCRIPTION OF THE DRAWINGS 

[0048] Fig. 1 : Genes Regulated by Diet. Previously unknown disease/diet associations 

for these genes are shown. 

[0049] Fig. 2: Genes Regulated by Diet Depending Upon Genotype. This table shows 

previously undisclosed data. 

[0050] Fig. 3: Genes regulated by diet, genotype, or genotype-diet interactions mapping 

to QTLs. This table shows previously undisclosed data. 

[0051] Fig. 4: Diet-regulated Genes and Disease or Complex Phenotype Processes 

(alphabetical by function). This table shows data known at the time of filing. 

[0052] DETAILED DESCRIPTION OF THE INVENTION 

[0053] A method for identifying diet-regulated disease-associated polynucleotides by 



doing the following: Two different inbred genotypes (known genotypes) are selected (A 
and B). One of these genotypes (A) is more susceptible to a disease (can be any 
"undesirable" phenotype), and the other genotype (B) is less susceptible to the same 
disease. Then each genotype is divided into two groups (Al and A2 and Bl and B2). 
For one genotype, each group is fed a different diet (Al is fed diet No.l and A2 is fed 
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diet No.2, and similarly for Bl and B2). Gene expression is then compared across the 
strains that differ in either genotype or in diet, but not in both. I.e., Al is compared with 
A2; Al is compared with Bl ; but Al is not compared with B2. This is because we want 
to deal with only one variable at a time. Differential gene expression is identified 
between the compared groups, and genes are identified that show significant changes in 
expression (e.g., a 1.5 or 2.0 or 2.5 -fold increase or decrease in gene expression. These 
genes are diet-regulated disease-associated genes. As a further step, these identified 
genes are then compared against independently-identified diet-regulated and/or disease 
associated QTL's. This step helps add assurance to the identification and to help 
differentiate cause from effect for genes that are differentially expressed in response to 
diet. 

More specifically, the method of the invention may be carried out as follows: a) 
comparing gene expression between two inbred strains in response to different diets, 
wherein one inbred strain is susceptible to a disease and the other inbred strain is not 
susceptible to the disease, b) identifying those differentially expressed polynucleotides 
that overlap with independently-derived diet-regulated QTLs, and c) analyzing the data 
to identify diet-regulated disease-associated polynucleotides. Generally the disease in 
question is a diet-associated disease. Gene expression is usually compared by comparing 
mRNA abundance (for example using a cDNA array), but may be compared by looking 
at protein levels. Often in-bred strains of mice are used mRNA abundance is compared 
between strains in response to different diets. Lastly, genes (or EST's or any 
polynucleotides) that have been identified as being significantly differentially expressed 
are compared with previously-identified, independently-derived diet-regulated QTLs. 
Various methods are also disclosed for use in disease screening, monitoring and 
treatment. The disclosed methods may also be used for formulating medical foods used 
to treat and prevent disease and slow disease progression. Arrays are also disclosed that 
employ one or more genes/polynucleotides identified by the method of the invention. 
Various new diet-regulated disease-associated genes and ESTs are also disclosed, as are 
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compositions of medical foods and dietary supplements that have use for 
prophylactically treating populations and individuals susceptible to disease, or 
therapeutically treating populations and individuals who have disease. 

[0055] Using the method of the invention, gene expression comparisons are made within 

a strain based upon differences in diet, and between strains fed the same diets. We can 
therefore identify genes regulated by diet in one or both strains, genes regulated by the 
one or both genotypes, and genes regulated by the interaction between diet and 
genotype. That is, certain genes will be regulated without regard to diet - the regulation 
of these genes will depend upon genetic makeup only. Other genes will be regulated in 
the same manner in all individuals in the population by what is eaten, although the level 
of the response may differ among individuals. The regulation of other genes will depend 
upon an individual's genetic makeup and how it responds to dietary variables. The 
regulation of these genes therefore differs between individuals even if they eat the same 
concentrations of dietary chemicals. 

[0056] Since gene expression differences result from differences in DNA sequence 

(0.1% difference among humans), the methods of the invention can be practiced by 
associating genetic differences in the identified genes with disease incidence, severity, or 
progression. That is, single nucleotide polymorphisms (SNP) or other polymorphisms in 
the identified gene sequences can be used to identify the variants of diet-regulated genes 
that are associated with disease or that predict the severity of the disease once diagnosed. 
The method identifies genes whose abundance (or regulation) is affected by diet. 
However, a logical and obvious extension is that differences in protein or enzyme 
activities of these diet-regulated genes are also likely to influence disease development 
or severity. Analyzing SNPs or other polymorphisms in the promoter and gene may 
therefore be used in place of expression profiling. 

[0057] Various embodiments of the invention include the following: 
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[0058] A method for determining the susceptibility of an individual to a disease, wherein 

said disease involves a diet-regulated disease-associated polynucleotide, the method 
comprising: screening an individual for the presence and/or expression of a plurality of 
polynucleotides identified by the method above or by associated polymorphisms in gene 
sequence, wherein the pattern of expression of polymorphisms of and in said plurality of 
polynucleotides corresponds with the susceptibility of an individual to a certain disease. 

[0059] A method for monitoring the progression of a disease in a subject, the method 

comprising: at a first date, screening an individual for the presence and/or expression of 
a plurality of polynucleotides identified by the method above; at a second date re- 
screening the individual for the expression of the same plurality of polynucleotides, 
wherein a change in polynucleotide expression corresponds with the desirable or 
undesirable progression of a disease. 

[0060] A method for treating a subject so as to reduce the risk of the individual 

developing a diet-associated disease, the method comprising: screening an individual for 
the presence and/or expression of a plurality of polynucleotides identified by the method 
above, wherein the pattern of expression of said plurality of polynucleotides corresponds 
with the susceptibility of an individual to a certain disease; and altering the expression of 
one or more diet-regulated disease-associated polynucleotides to reduce the risk of the 
subject developing the disease. 

[0061] A method for treating a subject so as to reduce the risk of the individual 

developing a diet-associated disease, the method comprising: screening an individual for 
the presence and/or expression of a plurality of polynucleotides identified by the method 
above, wherein the pattern of expression of said plurality of polynucleotides corresponds 
with the susceptibility of an individual to a certain disease, and altering the diet of the 
individual so as to reduce the risk of the subject developing the disease. 
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[0062] . A method for treating a subject so as to ameliorate a diet-associated disease, the 
method comprising: screening an individual for the presence and/or expression of a 
plurality of polynucleotides identified by the method above, wherein the pattern of 
expression of said plurality of polynucleotides corresponds with the susceptibility of an 
individual to a certain disease; and altering the expression of one or more diet-regulated 
disease-associated polynucleotides so as to affect an improvement in the progression of 
the disease. 

[0063] A method for treating a subject so as to reduce the progression of a diet- 

associated disease, the method comprising: screening an individual for the presence 
and/or expression of a plurality of polynucleotides identified by the method above, 
wherein the pattern of expression of said plurality of polynucleotides corresponds with 
the susceptibility of an individual to a certain disease, and altering the diet of the 
individual so as to affect an improvement in the progression of the disease. 

[0064] A method for identifying the suitability of various drugs or medical food 

regimens for a subject diagnosed with a disease, the method comprising: screening an 
individual for the presence polymorphisms regulated by diet, using the above methods. 

[0065] A method for identifying genetic susceptibility of a subject to a chronic disease 

so as to select appropriate drug(s) or diets for reducing the incidence, severity, or 
progression of the disease or symptoms of the disease, the method comprising: screening 
an individual for the presence polymorphisms in genes regulated by diet, genotype, or 
their interactions using the above methods. 

[0066] In all cases, the method claims the use of individual or combinations of diet-, 

genotype-, or diet/genotype-regulated genes for diagnostics and/or development of 
drugs. 
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[0067] The strategy described for identifying genes participating in disease processes 

uses both gene expression analyses in inbred strains of mice (susceptible to disease and 
not susceptible to disease) and publicly available QTL information [4, 5, 23-26]. The 
strategy consists of a multi-step procedure for identifying nutrient-regulated genes 
involved in disease development in laboratory animals differing in genetic susceptibility 
to diet-induced disorders. Altering a diet in genetically-susceptible individuals may be a 
way to slow the initiation and/or change disease progression. 

[0068] Identification of Diet-Regulated Disease-Associated ESTs and Genes 

[0069] A multi-step procedure was developed to identify genes regulated by dietary 

chemicals that participate in disease development. 
[0070] 1 . Analyze mRNA abundance in inbred strains of mice in response to different 

diets using cDNA arrays (variations include measuring protein abundance). One strain 

of mice used is susceptible to a disease and the other strain is not be susceptible. Diets 

are chosen to induce chronic diseases such as obesity or diabetes. 

[0071] 2. Compare expression profiles between inbred strains of mice that differ in 

susceptibility to diet-induced disease. Differences in mRNA abundance identify genes 
regulated by genotype, diet, and their interactions. Since the diet is chosen to induce 
disease development in susceptible genotypes, a subset of these genes will be involved 
in disease development. 

[0072] 3. Differentiate between cause from effect genes by determining the map position 

of diet-regulated to independently-derived QTLs. Differentially expressed genes that 
overlap QTLs become candidate genes for the disease (an example of association 
mapping [16]). 
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[0073] 4. Characterize the expression and activity of a subset of the genes and proteins 

that were identified by expression array technology and QTL analyses in animal models 
of the disease 

[0074] 5. Identify polymorphisms in candidate disease genes 

[0075] 6. Examine their associations in humans who are healthy vs. those showing 

symptoms of the disease. 

[0076] Detailed Methodology 

[0077] Animals, diets and protocols, Male or virgin female (eliminates complications 



and effects of pregnancy) mice of defined genotype are fed a semi-purified diet 
containing 4% corn oil for 1 wk and then randomly assigned to control or experimental 
diets for at least 2 wks and up to the normal lifespan of the mouse. Each diet contains 
1 .4% of the respective total oil content as soybean oil to assure adequate fatty acid 
content (NRC 1995, [27]). An example of the diets is shown in Table 2. The diets are 
formulated according to modified AIN-76 guidelines (NRC 1995,[27]) ? were pelleted, 
and color coded by a commercial vendor. The diets are balanced for minerals, vitamins, 
fiber, and protein, and differ in carbohydrate and lipid level. In some cases, individual 
chemicals (natural or man-made) can be added to the diet to determine their effect on 
gene expression. 

[0078] Mice are caged and fed individually with free access to food and distilled water 

in temperature-controlled rooms maintained at 23 ± 1°C with a 12-h lightdark cycle. 
Animal care meets National Institutes of Health guidelines. Food spillage is also 
monitored throughout the course of the experiment. Efficiencies of energy utilization are 
calculated from the recorded weekly weight gain/calculated weekly energy intake. 
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At the end of the feeding period, all mice were deprived of food for 12 h and 
offered a preweighed 3-g pellet of their assigned diet. After 2 h, the uneaten food was 
removed; at a defined time after, all mice were injected intramuscularly with 0.02 mL/g 
body weight of ketaset/xylazine mixture (Ketaset, Fort Dodge Laboratories, Ft. Dodge, 
IA) for collection of blood via cardiac puncture. Immediately thereafter, they are killed 
by cervical dislocation and their livers and hearts removed, individually frozen in liquid 
nitrogen, and stored at 80°C for mRNA isolation. Altering the length of time for 
depriving food, for length of feeding, and for time to collection would all be trivial and 
obvious modifications of this protocol. 
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Table 2. Diet composition (exemplary) 



Ingredient 


Corn 




Coconut 






4% 


20% 


4% 


20% 












Corn oil 


2.8 


18.6 


0 


0 


Coconut oil 


0 


0 


2.8 


18.6 


Soybean oil 


1.4 


1.4 


1.4 


1.4 


Maltodextrin 


11.8 


14.1 


11.8 


14.1 


Cornstarch 


45.0 


19.3 


45.0 


19.3 


Sucrose 


9.5 


11.2 


9.5 


11.2 


Casein, alcohol extracted 


19.0 


22.6 


19.0 


22.6 


L-Cystine 


0.3 


0.3 


0.3 


0.3 


Cellulose 


4.7 


5.6 


4.7 


5.6 


Mineral mix 1 


1.0 


1.1 


1.0 


1.1 


Calcium phosphate 


1.2 


1.5 


1.2 


1.5 


Calcium carbonate 


0.6 


0.7 


0.6 


0.7 


Potassium citrate, monohydrate 


1.6 


1.9 


1.6 


1.9 


Vitamin mix 1 


1.0 


1.1 


1.0 


1.1 


Choline bitartrate 


0.2 


0.2 


0.2 


0.2 


Total grams 


100.1 


94.2 


100.1 


94.2 


kJ/g 2 


15.9 


19.2 


15.9 


19.2 


Protein, kJ% 


20.0 


20.0 


20.0 


20.0 


Lipid, kJ% 


10.0 


40.0 


10.0 


40.0 


Carbohydrate, kJ% 


70.0 


40.0 


70.0 


40.0 


Minerals, mg/kJ 


2.7 


2.6 


2.7 


2.6 


Vitamins, mg/kJ 


0.6 


0.6 


0.6 


0.6 


Cellulose, mg/kJ 


3.0 


2.9 


3.0 


2.9 



1 AIN-76A diet recommendations (National Research Council 1995). 

2 Amount of energy is 16.7 kJ/g for protein and carbohydrate, 37.6 kJ/g fat. 
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[0080J Molecular technologies. All molecular technologies, such as RNA isolation, 

RNA analyses on DNA or microarrays, or proteomic analyses, are standard. 

[0081] One of the important unique features of the present invention is that gene 

expression comparisons are made within a strain based upon differences in diet, and 
between strains fed the same diets. We can therefore identify genes regulated by diet in 
one or both strains, genes regulated by the one or both genotypes, and genes regulated by 
the interaction between diet and genotype. Our proof of principle data demonstrated that 
we identify genes regulated by each of these factors (strain-specific, diet-specific, 
genotype / diet interaction-specific, and genotype-specific). To our knowledge, no other 
experimental paradigm is capable of this type of analyses. Our strategy can also 
determine whether known candidate disease genes are regulated by dietary factors. 

[0082] EXAMPLE 

[0083] Our most complete gene expression study was designed to identify mammary 

tissue genes regulated by caloric restriction and their role in mammary cancer 
development. There is considerable evidence accumulating that caloric restriction 
reduces the incidence and severity of several diseases, including breast cancer. The 
strains used were obese yellow A vy /A and agouti A/a that are segregants (i.e., 
littermates) of a (BALB/cSTCrlfC3H/Nctr x VYWffC3Hf/Nctr- A vy mating. The A vy 
mutation is caused by insertion of an intracisternal A particle genome in the promoter of 
the agouti (A) gene. It might be considered one of the first (albeit naturally occurring) 
transgenic mouse strains. The 131 amino acid long agouti gene product is ectopically 
expressed resulting in yellow coat color, obesity, hyperinsulinemia, and increased risk of 
spontaneous and chemically induced mammary cancer. Since A vy /A become obese 
(~60g v ~25g for A/a mice) and develop symptoms of Type II diabetes, we used cDNA 
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arrays to analyze genes regulated in liver in response to genotype, diet (100% vs 70% 
calories), and the interaction between diet and genotype. 

[0084] Experimental details: Fifty-two mice of each strain fed ad libitum and 52 mice 

of each genotype fed restricted diets (70% calories) in the control group. Mice were fed 
a defined diet (not shown but available upon request) for 30d, 60d, 90d, and 1 yr - with 
5 mice per group. Another set of 394 animals were used for a mammary carcinogenesis 
experiment. Our specific feeding regimen was used during the last day of the 
experimental period to ensure that all animals were in a fed state at death. Liver, 
mammary, pancreas, kidney, heart, colon, and intestine were removed from each mouse 
and are available for gene expression analyses. Phenotypic characterizations were 
conducted (e.g., food intake, body weights) since they may be correlated with expression 
of one or more genes. mRNA from livers were isolated, converted to radioactive cDNA, 
and used to probe for Genome Systems membranes spotted with -18,000 mouse genes. 

Liver Gene Regulation Results: Gene regulation was compared in duplicate 
between (read across columns): 



Genotype/Calories vs Genotype/Calories Regulation by: 

Obese (A^/A) - 1 00% vs Obese (A^/A) - 70% Diet 

Normal (A/a) - 1 00% vs Normal (A/a) - 70% Diet 

Obese ( A^/A) - 1 00% vs Normal (A/a) - 1 00% Genotype 

Obese (A^/A) - 70% vs Normal (A/a) - 70% Genotype 



[0085] For each pair, expression levels of 1 8,000 different genes were compared. The 

comparisons were done in duplicate. Approximately 600 genes changed expression 
levels (mRNA) greater than or equal to 2.5 fold (up-regulated) or greater than or equal 
to 1/2.5 fold* (down-regulated) between diets or genotypes in the replicate experiments 
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Thus the abundance of particular mRNAs was 2.5 times more in one condition and/or 
genotype combination relative to another condition and/or genotype. The inventors 
chose a factor of 2.5 as a threshold in the change in gene expression (mRNA) to 
determine a significant change in expression, but the method may be performed using 
any number of different thresholds, depending on the sensitivity and specificity required. 
The threshold increase in gene expression to be used to determine a significant change, 
and therefore a potentially important target gene/EST may be determined by routine 
experimentation by one of skill in the art, and may, for example, be lx, 1 .5x, 2x, 2.5x, 
3x, 3.5x, 4x, 4.5x, 5x, 6x, 7x, lOx, 15x, 20x, 30x, 50x or more. 

[0086] Of the 601 genes, 376 have been previously identified. The genes were sorted 

into functional groups (Table 2). 

[0087] *(4 groups were used: Al, A2, Bl, B2. The following comparisons were done to create ratios: (1) A1/A2 

and A2/A1; B1/B2 and B2/B1; Al/Bl and Bl/Al; A2/B2 and B2/A2. All of these ratios had to be equal to 2.5 or 
greater than 2.5 to be ncluded for the gene to be considered statistically significant. A short form for writing all of 
these combinations is to use the reciprocal for the second comparison; which in the present example is the reciprocal 
of 2.5 which is 1/2.5 (instead of saying A1/A2 and A2/A1 just say A1/A2 and its reciprocal). Mathematically the 
relationship is statement is: 1/2.5 = < A1/A2 >= 2.5.) 

[0088] Table 2. Functions of Diet- and Genotype-Regulated Genes 



Function 


Number 


Structural Proteins 


71 


Metabolic Enzymes 


82 


Proteases/Inhibitors 


19 


RNA interacting proteins 


12 


Signal Transduction/Transcription Factors 


98 


Unknown Function 


41 
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[0089] The map position of each of the 601 diet-regulated genes was determined using 

publicly-available chromosomal maps at the Jackson Laboratory and compared to 
independently-derived quantitative trait loci. 1 1 7 of the diet- and genotype-regulated 
genes mapped to diabetes, obesity, and/or growth QTLs (some of these loci overlapped). 
Of these 117 genes, 41 were involved in signal transduction and/or transcriptional 
regulation processes (Table 3). All or a subset of these genes will be used as diagnostics 
for type 2 diabetes and obesity, and a subset are candidate drug targets. 32 genes were 
identified as being associated with diabetes and are shown in the dada set. 

[0090] Genes Regulated by Genotype 

[0091] The sum total of gene expression of all genotype-regulated genes contributes to 

physiological differences between A vy /A and A/a mice. Fifteen (15) genes were 
regulated at 2.5 fold (or 1 / 2.5 fold) difference by genotype regardless of caloric intake. 
Several notable examples illustrate how genotype alters expression of key genes and 
therefore phenotype: 

[0092] Snrk, SNF-1 related kinase was more abundant in A/a relative to A^/A. SNF1 is 

involved in chromatin remodeling by modifying histone H3. 
[0093] DNA methyltransferase {Dnmtl) mRNA was more abundant in A^/A relative to 

A/a mice . In general, methylation marks transcriptionally inactive DNA and imprinting. 

Regulation of abundance of Dnmtl mRNA by diet may be one of the underlying 

mechanisms that increases the susceptibility to hepatocellular hyperplasia and neoplasms 

in A^/a mice. 

[0094] SLC22A 7, or the organic anion transporter (Oat), transports glutarate and 

prostaglandin E2. It is an imprinted gene. PGE2 has been shown to suppress expression 
of fatty acid synthase, 1-pyruvate kinase, and the S 14 protein mRNA in liver. 
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[0095] Diet-regulated genes 

[0096] Genes differentially regulated by calorie intake may play a role in the initiation 

or increased severity of the disease. Akt2 (PKB), a protein kinase is regulated by diet in 
the nonsusceptible genotype and the lack of similar regulation in the susceptible 
genotype may contribute to disease development. Akt2 has been proposed as a signal 
from the insulin receptor to insulin response elements of target genes. Mice lacking Akt2 
are hyperglycemic, hyperinsulinemic, and their livers produce glucose in the presence of 
insulin. 

[0097] Gene products with differential abundance in each genotype may also alter 

energy metabolism. 

[0098] Tpo, peroxidasin (aka, thyroid peroxidase, deiodinase), was diet regulated in the 

nonsusceptible genotype. The thyroid prohormone T4 is converted to active T3 or 
inactive reverse T3 in peripheral tissues by members of the peroxidasin (deiodinase) 
family of genes. A variant in human deiodinase gene is associated with insulin 
resistance. 

[0099] Mknkl is stress-activated map kinase interacting protein 1 . The stress-activated 

kinase pathway is implicated in regulating genes involved in energy metabolism. 

[00100] Hipk2, homeo domain interacting kinase, is a component of a corepressor 
complex regulating the homeobox gene product NK3. The other members of this 
complex are Groucho (a corepressor protein) and histone deacetylase, suggesting that 
Hipk2 plays a role in chromatin remodeling and transcriptional regulation. 

[00101] Rbpl, retinol binding protein, is the serum carrier of retinol. Retinol metabolites 
are transcriptional regulators that play key roles in energy metabolism and are linked to 
development of diabetes and obesity. 



24 



SUBSTITUTE SPECIFICATION Kaput-001 US 



[00102] Genotype + Diet Interactions Genes 

[00103] The majority of gene products (1 70 out of 376) were regulated by genotype plus 
diet interactions: that is, they were regulated differently depending upon combinations 
of genetic makeup (genotype) and dietary intake. 

[00104] Genes Mapping to Diabetes QTL 

[00105] The map positions of the 376 genes of known function were determined and 
compared to quantitative trait loci (QTL) associated with various subphenotypes of 
diabetes and obesity ("diabesity") (Figure 3). Thirty-two (32) of the diet-, genotype-, 
and diet + genotype-regulated genes in liver mapped to diabesity QTL (Figure 3). Genes 
mapping to weight gain or obesity QTLs were also determined (Figure 3). An interval 
distance of +/- 10 cM from a given QTL was used, a distance consistent with the marker 
density employed in most QTL association studies. 

[00106] Several of the gene products regulated by diet, genotype, or their interactions are 

associated with diabetes and/or are in pathways that alter phenotypes consistent with one 
of the conditions of diabetes (Table 3, Diabesity Association). Genes mapping to 
diabesity QTL and differently regulated are likely to cause differences in diabetes 
subphenotypes: 

[00107] Camk2b (0.5 cM) maps near a non-insulin dependent diabetes {Nidd4ri) QTL (2 
cM) on chromosome 1 1 . This gene product is a key regulator of insulin secretion by the 
pancreas but is also expressed in liver. The allele at Nidd4n plays a role in regulating 
insulimglucose ratios. 

[00108] Reccl, the large subunit of replication factor C is the ATP-dependent primer 
binding subunit of DNA polymerase, was found in the large complex with other DNA 
repair genes called BASC - the BRCA1 associated genome surveillance complex. 
Reccl (39 cM) maps near Dbsty2, a QTL associated with plasma glucose levels and 
several obesity and body weight QTLs (ObqJ2, Bw8, Bwob) on chromosome 5 (29 - 44 
cM). 
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[00109] Several other genes had more complex regulatory patterns but may play a role in 
causing differences in subphenotypes of diabesity: 

[00110] Mapkl (9.82 cM) maps near tally ho associated non-insulin dependent diabetes 
mellitus {Tanidd3, 5cM) on chromosome 16. In adipocytes, Mapkl regulates Foxc2, a 
transcription factor involved in regulating insulin responsiveness. An allele at Tanidd2 
is linked to hyperglycemia. 

[00111] Sftbl, a gene encoding subunit 1 of the splicing factor 3B, maps on chromosome 
1 (at 28.9 cM) near Dbstyl (21 cM), a locus associated with body weight, plasma insulin 
and plasma glucose levels. Several genes involved in insulin metabolism, including the 
insulin receptor and attractin (the receptor of the agouti protein) are alternatively spliced 
producing variants with differential influences on insulin metabolism. Leiter's group 
has demonstrated that Dbstyl interacts epistatically with Dbsty loci on chromosomes 
5,12, and 15. 

[00112] NeuroD is a transcription factor involved in insulin gene regulation in the 

pancreas. Mutations in this gene result in mature onset diabetes of the young {NeuroD = 
MODY6). NeuroD (46 cM) maps near a Type 2 diabetes mellitus (T2dm3) QTL (42.7) 
on chromosome 2 and 4 obesity or weight gain QTL (Mob7, Mob6, Obq3, Bw6). The 
allele at T2dm3 is linked with fasting plasma insulin levels. 

[00113] Several other diet-regulated genes produce proteins that are considered markers 
of diabetes. For example, fibronectin and albumin are increased in serum of diabetics 
and their glycation state correlates with the severity of the disease. Enzymes such as 
glutaminase and histidine ammonia-lyase may play a role in serum pH balance through 
the production of ammonia. Glycerol 3 -phosphate acyl transferase may alter 
triacylglycerol stores, a process which is nutritionally regulated and responsive to 
insulin. 
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[00114] Some individual genes have not been studied for a role in processes affected by 
diabetes, but members of their functional family have been linked to processes that are 
altered in that disease. Other genes and their products mapping to diabesity QTL have 
only a tentative association to conditions in diabetes - aquaporin, tricarboxylate 
transport protein, ankryin, metaloproteases, and debrins are examples although 
explanations can be developed for each of these potential candidates. Nevertheless, 
these . genes are candidates for diabetes subphenotypes by virtue of their regulation by 
genotype or diet and their map position near QTL associated with diabetes symptoms. 

[00115] The method and the newly identified diet-regulated disease-associated genes 
disclosed herein, may be used in a method to determine the susceptibility of an 
individual to a disease, wherein the disease involves one or more diet-regulated disease- 
associated polynucleotide identified by the method of the invention. The new method 
and the newly identified diet-regulated disease-associated genes may be used in a 
method for monitoring the progression of a disease by screening the individual for the 
presence and/or expression of a plurality of polynucleotides, and at a second date re- 
screening the individual for the expression of the same plurality of polynucleotides, 
wherein a change in polynucleotide expression corresponds to the desirable or 
undesirable progression of a disease. The invention includes a method for treating a 
subject so as to reduce the risk of the individual developing a diet-associated disease, by 
screening the individual and altering the expression of one or more diet-regulated 
disease-associated polynucleotides, or by altering diet, for example by feeding the 
individual particular amounts of combinations of nutrients to reduce the risk of the 
subject developing the disease. 

[00116] Additional embodiments include arrays (Microarrays, cDNA arrays) for 

detecting gene expression in subjects, wherein the genes are genes identified by the 
methods disclosed herein. The invention encompasses arrays having at least one 
nucleotide (probe) that corresponds to and/or binds specifically to at least one 

27 



SUBSTITUTE SPECIFICATION Kaput-001 US 



polynucleotide that has been identified by the disclosed methods as a diet-regulated 
disease-associated gene. Such an array may contain a plurality of such probes or cDNAs 
or polynucleotides, for example it may contain at least 5, 10, 20, 40, 60 or at least 100 
such probes. Arrays are well known and need not be discussed here, except to say that 
arrays made with combinations of probes identified by the methods of the invention may 
be used to screen individuals and populations for susceptibility to a disease and to 
monitor disease progression. 

[00117] Additionally, antibody probes and arrays may equally be used to detect protein 
expression and to screen for the presence and amount of proteins expressed by diet- 
regulated disease-associated genes. 

[00118] The methods disclosed herein, using arrays and other tools, may be used to 

screen populations to determine the presence and frequency of various diet-regulated 
disease-associated genes in a population or humans or animals. This information may be 
used to formulate foods ("medical foods" or "nutraceuticals") that provide enhanced 
health benefits to various individuals and populations. For example, in-bred breeds of 
dog, say Labradors, may be analyzed using the methods of the invention, and 
susceptibility to various diseases, such as diabetes, may be determined. In this case, pet- 
foods may be formulated for that particular breed of dog to provide particular health 
benefits and to address this breed's particular needs. The same methods of food 
formulation may be done for any animals including farm animals, pet animals, and 
humans. 

[00119] Genes Mapping to Obesity QTL 

[00120] Genes mapping to obesity loci may contribute to subphenotypes of obesity. 
Other QTL for adiposity at various sites and total carcass lipid levels (for a 
comprehensive review see Jackson Laboratory website) were not included in our 
analyses since these parameters were not measured in this study. Many diabesity QTL 
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overlap obesity QTL (Figure 3) as would be expected for the diabesity phenotype. 
Associations with many specific molecular pathways influencing or involved in obesity 
and/or weight gain can be made for each of the other diet-, genotype-, and diet + 
genotype-regulated genes mapping to obesity and weight gain QTLs (Figure 3). 
[00121] Diabesity is a complex trait resulting from interactions between multiple genes 
and environmental factors. In humans, chronic exposure to excessive calories, 
deficiencies of micronutrients, and certain types of macronutrients induce obesity and 
diabetes in individuals, presumably without deleterious mutations in participating genes. 
These diseases therefore fit the common variant / common disease hypothesis proposed 
by Lander and Collins and colleagues. We have proposed that one or more of the gene 
products participating in development of chronic diseases will be regulated at least in 
part by diet since different macronutrients and excess calories are associated with almost 
all chronic diseases. 

[00122] Although chronic diseases are multigenic in nature, much information regarding 
the pathways involved in disease development has been discovered by the study of 
rodent models with single gene defects or induced mutations (knockouts and 
transgenics) that mimic diabetes and/or obesity. Comprehensive reviews of the mouse 
models for insulin resistance and obesity have recently been published. The general 
conclusion from these reviews echoes the conclusion of Wolff that similar if not 
identical phenotypic expressions of a disease state can be reached by different metabolic 
routes. That is, alterations in many pathways can produce the same phenotype. 
Nevertheless, these genes and their variants may identify sets of pathways that 
collectively produce the specific diabetes subphenotypes and obesity pattern in one 
model of obesity and diabetes. That is, some of these pathways may also be involved in 
other models of diabesity if gene variants in the key regulatory or structural genes 
collectively produce expression changes similar to those observed in this specific model. 
Genes identified in the A^/A and A/a comparison may contribute to obesity or diabesity 
in humans if their regulation is altered in a similar manner. Although we and others 
identify candidate disease genes through gene expression analyses, changes in the 
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activity of proteins encoded by gene specific mRNAs may also be associated with 
disease development. Polymorphisms within the coding sequences of candidate genes 
(cSNPs) that alter their function also contribute to disease development [54]. 

[00123] We found 601 hepatic genes or ESTs regulated in the same manner in replicate 
experiments with 376 genes having a known function. We used a cutoff ratio of 2.5 fold 
or greater or 1 / 2.5 fold or less to define regulation because of hybridization conditions 
and type of DNA array. Genes of all functional classes and types of regulation were 
differently expressed in this genotype comparison. 

[00124] Quantitative trait loci are used to associate chromosomal regions with complex 
traits. There are now over 1 700 QTL for disease, subphenotypes of disease, enzyme or 
protein levels, behavior, and other complex traits in mice (see Jackson Laboratory 
website). The limitations of using QTL data are that (i) they may be specific to the 
inbred strains analyzed, (ii) identify 20 - 30 cM regions of DNA, and (iii) often can not 
detect interactions with other loci. In addition, few mapping studies rigorously control 
or report diets; environment is known to have a large influence on the identification of 
QTL affecting complex traits, at least in plants. Our approach combines the strength of 
array technology with the power of genetics to identify potential causative genes. A key 
additional component of our approach is the rigorous control of diet composition and a 
timed feeding regimen that will allow for replication of the experiments. 

[00125] Even with limitations of the current experiment (type of array, number of mice, 

single tissue source), the data presented herein identify potential novel candidate genes 
in many different functional pathways that may play a role in expression of 
subphenotypes of diabesity. Several of the genes that were found to be diet-regulated 
and mapped to diabetes QTL had previously been linked to specific pathways affected 
by or involved in diabetes. NeuroD, Camk2b, and Rgs2 are linked to Type 2 diabetes or 
are in pathways directly regulating insulin function. Fibronectin and albumin are used to 
monitor glycation levels in diabetics; however, their regulation and map position within 
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diabesity loci suggest that they may play a more active role in evolution of the chronic 
disease state. Two genes which produce ammonia also mapped to QTL suggesting a 
potential mechanism for acid:base balance and alterations in nitrogen metabolism in 
diabetes. Other genes mapping to QTL from our screen can be associated with various 
alterations in metabolism found in diabesity. They become candidates for further 
testing. 

[00126] Since obesity is a more "amorphous" phenotype, with adiposity, weight gain, and 

overall weight as the key phenotypic markers, it is more difficult to compare candidates 
identified in this screen with those found in other model organisms or humans. Genes 
analyzed in this screen that are regulated by diet, genotype, and genotype X diet that 
map to obesity QTL may be considered candidates for obesity development or severity. 

[00127] The strategy described is a means to identify diet-, genotype-, and genotype X 
diet-regulated genes that cause or promote the development and severity of complex 
phenotypes. This approach can be applied to comparisons of each mutant model and its 
normal inbred parent or strain and to congenic siblings produced specifically for 
separating and combining QTL producing a complex phenotype. By comparing across 
the different genotypes fed the same diet, genotype-regulated genes can be identified. 
Similarly, by feeding two or more diets to mice with different genotypes, diet-regulated- 
and diet + genotype-regulated genes can be identified. 

[00128] Understanding diabetes and obesity will require integration of knowledge from 
individual pathways that have been elucidated to date. However, inclusion of diet as a 
variable in a systems biology approach will also be necessary to fully explain complex 
phenotypes, almost all of which are influenced by environment, and specifically by 
dietary variables. This type of scientific study is called nutrigenomics or nutritional 
genomics. Knowledge of the interactions of diet and genotype will be needed when 
testing and treating these diseases in human populations. 
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[00129] Although various examples are given in this disclosure, they are for illustrative 
purpose only and in no way limit the scope of the invention. 
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